Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 1245 |
| Missing cells | 476 |
| Missing cells (%) | 1.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 233.6 KiB |
| Average record size in memory | 192.1 B |
Variable types
| Numeric | 19 |
|---|---|
| Categorical | 5 |
id.x has a high cardinality: 1035 distinct values | High cardinality |
IVS_2000 is highly correlated with IVS_REN_00 and 3 other fields | High correlation |
IVS_INF_00 is highly correlated with IVS_INF_10 | High correlation |
IVS_CPH_00 is highly correlated with IVS_CPH_10 and 1 other fields | High correlation |
IVS_REN_00 is highly correlated with IVS_2000 and 1 other fields | High correlation |
IVS_2010 is highly correlated with IVS_2000 and 1 other fields | High correlation |
IVS_INF_10 is highly correlated with IVS_2000 and 1 other fields | High correlation |
IVS_CPH_10 is highly correlated with IVS_CPH_00 and 4 other fields | High correlation |
IVS_REN_10 is highly correlated with IVS_2000 and 2 other fields | High correlation |
MASC is highly correlated with FEM and 2 other fields | High correlation |
FEM is highly correlated with MASC and 2 other fields | High correlation |
POP is highly correlated with MASC and 2 other fields | High correlation |
DOM_OCU is highly correlated with MASC and 2 other fields | High correlation |
Dens_Dom is highly correlated with Dens_hab | High correlation |
Dens_hab is highly correlated with Dens_Dom | High correlation |
Pop_Urbana is highly correlated with IVS_CPH_10 and 3 other fields | High correlation |
Pop_Rural is highly correlated with IVS_CPH_10 and 3 other fields | High correlation |
Pop_Total is highly correlated with IVS_CPH_10 and 3 other fields | High correlation |
Area is highly correlated with IVS_CPH_00 and 4 other fields | High correlation |
VTN_MED is highly correlated with NM_MU | High correlation |
NM_MU is highly correlated with VTN_MED | High correlation |
id.x has 28 (2.2%) missing values | Missing |
NM_MU has 28 (2.2%) missing values | Missing |
IVS_2000 has 28 (2.2%) missing values | Missing |
IVS_INF_00 has 28 (2.2%) missing values | Missing |
IVS_CPH_00 has 28 (2.2%) missing values | Missing |
IVS_REN_00 has 28 (2.2%) missing values | Missing |
IVS_2010 has 28 (2.2%) missing values | Missing |
IVS_INF_10 has 28 (2.2%) missing values | Missing |
IVS_CPH_10 has 28 (2.2%) missing values | Missing |
IVS_REN_10 has 28 (2.2%) missing values | Missing |
MASC has 28 (2.2%) missing values | Missing |
FEM has 28 (2.2%) missing values | Missing |
POP has 28 (2.2%) missing values | Missing |
DOM_OCU has 28 (2.2%) missing values | Missing |
Dens_Dom has 28 (2.2%) missing values | Missing |
Dens_hab has 28 (2.2%) missing values | Missing |
URB_RURAL has 28 (2.2%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
id.x is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
MASC has 39 (3.1%) zeros | Zeros |
FEM has 42 (3.4%) zeros | Zeros |
POP has 39 (3.1%) zeros | Zeros |
DOM_OCU has 39 (3.1%) zeros | Zeros |
Dens_Dom has 428 (34.4%) zeros | Zeros |
Dens_hab has 39 (3.1%) zeros | Zeros |
Reproduction
| Analysis started | 2021-02-22 14:38:24.948958 |
|---|---|
| Analysis finished | 2021-02-22 14:39:17.031780 |
| Duration | 52.08 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 1245 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 623 |
|---|---|
| Minimum | 1 |
| Maximum | 1245 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 63.2 |
| Q1 | 312 |
| median | 623 |
| Q3 | 934 |
| 95-th percentile | 1182.8 |
| Maximum | 1245 |
| Range | 1244 |
| Interquartile range (IQR) | 622 |
Descriptive statistics
| Standard deviation | 359.5448512 |
|---|---|
| Coefficient of variation (CV) | 0.5771185412 |
| Kurtosis | -1.2 |
| Mean | 623 |
| Median Absolute Deviation (MAD) | 311 |
| Skewness | 0 |
| Sum | 775635 |
| Variance | 129272.5 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1245 | 1 | 0.1% |
| 417 | 1 | 0.1% |
| 410 | 1 | 0.1% |
| 411 | 1 | 0.1% |
| 412 | 1 | 0.1% |
| 413 | 1 | 0.1% |
| 414 | 1 | 0.1% |
| 415 | 1 | 0.1% |
| 416 | 1 | 0.1% |
| 418 | 1 | 0.1% |
| Other values (1235) | 1235 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
| Value | Count | Frequency (%) |
| 1245 | 1 | |
| 1244 | 1 | |
| 1243 | 1 | |
| 1242 | 1 | |
| 1241 | 1 |
| Distinct | 1035 |
|---|---|
| Distinct (%) | 85.0% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Memory size | 9.9 KiB |
| F224 | 8 |
|---|---|
| F197 | 4 |
| F230 | 4 |
| F281 | 4 |
| F979 | 4 |
| Other values (1030) |
Length
| Max length | 43 |
|---|---|
| Median length | 5 |
| Mean length | 4.597370583 |
| Min length | 4 |
Characters and Unicode
| Total characters | 5595 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 878 ? |
|---|---|
| Unique (%) | 72.1% |
Sample
| 1st row | F1810 |
|---|---|
| 2nd row | F1811 |
| 3rd row | F1811 |
| 4th row | F1805 |
| 5th row | F1804 |
| Value | Count | Frequency (%) |
| F224 | 8 | 0.6% |
| F197 | 4 | 0.3% |
| F230 | 4 | 0.3% |
| F281 | 4 | 0.3% |
| F979 | 4 | 0.3% |
| F277 | 4 | 0.3% |
| F137 | 4 | 0.3% |
| F343 | 3 | 0.2% |
| F2779 | 3 | 0.2% |
| F3006 | 3 | 0.2% |
| Other values (1025) | 1176 | |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| f224 | 8 | 0.7% |
| f281 | 4 | 0.3% |
| f197 | 4 | 0.3% |
| f137 | 4 | 0.3% |
| f979 | 4 | 0.3% |
| f277 | 4 | 0.3% |
| f230 | 4 | 0.3% |
| f151 | 3 | 0.2% |
| f2779 | 3 | 0.2% |
| f2986 | 3 | 0.2% |
| Other values (1029) | 1180 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 1216 | |
| 2 | 785 | |
| 1 | 697 | |
| 3 | 493 | |
| 5 | 438 | 7.8% |
| 4 | 387 | 6.9% |
| 0 | 353 | 6.3% |
| 9 | 305 | 5.5% |
| 6 | 304 | 5.4% |
| 8 | 294 | 5.3% |
| Other values (27) | 323 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4327 | |
| Uppercase Letter | 1226 | 21.9% |
| Lowercase Letter | 30 | 0.5% |
| Control | 4 | 0.1% |
| Final Punctuation | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Space Separator | 2 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Currency Symbol | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 5 | |
| a | 5 | |
| e | 4 | |
| d | 2 | 6.7% |
| n | 2 | 6.7% |
| c | 2 | 6.7% |
| l | 2 | 6.7% |
| ƒ | 2 | 6.7% |
| s | 1 | 3.3% |
| f | 1 | 3.3% |
| Other values (4) | 4 |
| Value | Count | Frequency (%) |
| 2 | 785 | |
| 1 | 697 | |
| 3 | 493 | |
| 5 | 438 | |
| 4 | 387 | |
| 0 | 353 | |
| 9 | 305 | 7.0% |
| 6 | 304 | 7.0% |
| 8 | 294 | 6.8% |
| 7 | 271 | 6.3% |
| Value | Count | Frequency (%) |
| F | 1216 | |
| Ã | 4 | 0.3% |
| Æ | 2 | 0.2% |
| Â | 2 | 0.2% |
| R | 1 | 0.1% |
| E | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 2 | ||
| 2 |
| Value | Count | Frequency (%) |
| ’ | 2 |
| Value | Count | Frequency (%) |
| ‚ | 2 |
| Value | Count | Frequency (%) |
| § | 1 |
| Value | Count | Frequency (%) |
| £ | 1 |
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4339 | |
| Latin | 1256 | 22.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 1216 | |
| i | 5 | 0.4% |
| a | 5 | 0.4% |
| e | 4 | 0.3% |
| Ã | 4 | 0.3% |
| d | 2 | 0.2% |
| n | 2 | 0.2% |
| c | 2 | 0.2% |
| l | 2 | 0.2% |
| ƒ | 2 | 0.2% |
| Other values (10) | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 2 | 785 | |
| 1 | 697 | |
| 3 | 493 | |
| 5 | 438 | |
| 4 | 387 | |
| 0 | 353 | |
| 9 | 305 | 7.0% |
| 6 | 304 | 7.0% |
| 8 | 294 | 6.8% |
| 7 | 271 | 6.2% |
| Other values (7) | 12 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5579 | |
| None | 12 | 0.2% |
| Punctuation | 4 | 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 1216 | |
| 2 | 785 | |
| 1 | 697 | |
| 3 | 493 | |
| 5 | 438 | 7.9% |
| 4 | 387 | 6.9% |
| 0 | 353 | 6.3% |
| 9 | 305 | 5.5% |
| 6 | 304 | 5.4% |
| 8 | 294 | 5.3% |
| Other values (19) | 307 | 5.5% |
| Value | Count | Frequency (%) |
| Ã | 4 | |
| ƒ | 2 | |
| Æ | 2 | |
| Â | 2 | |
| § | 1 | 8.3% |
| £ | 1 | 8.3% |
| Value | Count | Frequency (%) |
| ’ | 2 | |
| ‚ | 2 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Memory size | 9.9 KiB |
| CARUARU | |
|---|---|
| AGRESTINA | |
| TORITAMA | |
| PANELAS | |
| TAQUARITINGA DO NORTE | |
| Other values (2) | 35 |
Length
| Max length | 21 |
|---|---|
| Median length | 7 |
| Mean length | 8.345932621 |
| Min length | 6 |
Characters and Unicode
| Total characters | 10157 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TAQUARITINGA DO NORTE |
|---|---|
| 2nd row | TAQUARITINGA DO NORTE |
| 3rd row | TAQUARITINGA DO NORTE |
| 4th row | TAQUARITINGA DO NORTE |
| 5th row | TAQUARITINGA DO NORTE |
| Value | Count | Frequency (%) |
| CARUARU | 650 | |
| AGRESTINA | 197 | 15.8% |
| TORITAMA | 167 | 13.4% |
| PANELAS | 92 | 7.4% |
| TAQUARITINGA DO NORTE | 76 | 6.1% |
| CUPIRA | 23 | 1.8% |
| QUIPAPà| 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| caruaru | 650 | |
| agrestina | 197 | 14.4% |
| toritama | 167 | 12.2% |
| panelas | 92 | 6.7% |
| do | 76 | 5.6% |
| taquaritinga | 76 | 5.6% |
| norte | 76 | 5.6% |
| cupira | 23 | 1.7% |
| quipapãƒâ | 12 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2475 | |
| R | 1839 | |
| U | 1411 | |
| T | 759 | 7.5% |
| C | 673 | 6.6% |
| I | 551 | 5.4% |
| N | 441 | 4.3% |
| E | 365 | 3.6% |
| O | 319 | 3.1% |
| S | 289 | 2.8% |
| Other values (11) | 1035 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9981 | |
| Space Separator | 152 | 1.5% |
| Lowercase Letter | 12 | 0.1% |
| Control | 12 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 2475 | |
| R | 1839 | |
| U | 1411 | |
| T | 759 | 7.6% |
| C | 673 | 6.7% |
| I | 551 | 5.5% |
| N | 441 | 4.4% |
| E | 365 | 3.7% |
| O | 319 | 3.2% |
| S | 289 | 2.9% |
| Other values (8) | 859 | 8.6% |
| Value | Count | Frequency (%) |
| 152 |
| Value | Count | Frequency (%) |
| ƒ | 12 |
| Value | Count | Frequency (%) |
| | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9993 | |
| Common | 164 | 1.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 2475 | |
| R | 1839 | |
| U | 1411 | |
| T | 759 | 7.6% |
| C | 673 | 6.7% |
| I | 551 | 5.5% |
| N | 441 | 4.4% |
| E | 365 | 3.7% |
| O | 319 | 3.2% |
| S | 289 | 2.9% |
| Other values (9) | 871 | 8.7% |
| Value | Count | Frequency (%) |
| 152 | ||
| | 12 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10109 | |
| None | 48 | 0.5% |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 2475 | |
| R | 1839 | |
| U | 1411 | |
| T | 759 | 7.5% |
| C | 673 | 6.7% |
| I | 551 | 5.5% |
| N | 441 | 4.4% |
| E | 365 | 3.6% |
| O | 319 | 3.2% |
| S | 289 | 2.9% |
| Other values (7) | 987 | 9.8% |
| Value | Count | Frequency (%) |
| Ã | 12 | |
| ƒ | 12 | |
| Â | 12 | |
| | 12 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4884831553 |
|---|---|
| Minimum | 0.428 |
| Maximum | 0.692 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.428 |
|---|---|
| 5-th percentile | 0.428 |
| Q1 | 0.447 |
| median | 0.447 |
| Q3 | 0.577 |
| 95-th percentile | 0.59 |
| Maximum | 0.692 |
| Range | 0.264 |
| Interquartile range (IQR) | 0.13 |
Descriptive statistics
| Standard deviation | 0.06701324192 |
|---|---|
| Coefficient of variation (CV) | 0.1371863926 |
| Kurtosis | -0.7543111025 |
| Mean | 0.4884831553 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8982394257 |
| Sum | 594.484 |
| Variance | 0.004490774593 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.447 | 650 | |
| 0.587 | 197 | 15.8% |
| 0.428 | 167 | 13.4% |
| 0.59 | 92 | 7.4% |
| 0.539 | 76 | 6.1% |
| 0.577 | 23 | 1.8% |
| 0.692 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.428 | 167 | 13.4% |
| 0.447 | 650 | |
| 0.539 | 76 | 6.1% |
| 0.577 | 23 | 1.8% |
| 0.587 | 197 | 15.8% |
| Value | Count | Frequency (%) |
| 0.692 | 12 | 1.0% |
| 0.59 | 92 | |
| 0.587 | 197 | |
| 0.577 | 23 | 1.8% |
| 0.539 | 76 | 6.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2147477403 |
|---|---|
| Minimum | 0.069 |
| Maximum | 0.474 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.069 |
|---|---|
| 5-th percentile | 0.069 |
| Q1 | 0.178 |
| median | 0.178 |
| Q3 | 0.309 |
| 95-th percentile | 0.406 |
| Maximum | 0.474 |
| Range | 0.405 |
| Interquartile range (IQR) | 0.131 |
Descriptive statistics
| Standard deviation | 0.0968158162 |
|---|---|
| Coefficient of variation (CV) | 0.450835087 |
| Kurtosis | -0.4019790726 |
| Mean | 0.2147477403 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5210713938 |
| Sum | 261.348 |
| Variance | 0.009373302267 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.178 | 650 | |
| 0.309 | 197 | 15.8% |
| 0.069 | 167 | 13.4% |
| 0.348 | 92 | 7.4% |
| 0.406 | 76 | 6.1% |
| 0.204 | 23 | 1.8% |
| 0.474 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.069 | 167 | 13.4% |
| 0.178 | 650 | |
| 0.204 | 23 | 1.8% |
| 0.309 | 197 | 15.8% |
| 0.348 | 92 | 7.4% |
| Value | Count | Frequency (%) |
| 0.474 | 12 | 1.0% |
| 0.406 | 76 | 6.1% |
| 0.348 | 92 | |
| 0.309 | 197 | |
| 0.204 | 23 | 1.8% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6754798685 |
|---|---|
| Minimum | 0.64 |
| Maximum | 0.863 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.64 |
|---|---|
| 5-th percentile | 0.64 |
| Q1 | 0.64 |
| median | 0.64 |
| Q3 | 0.716 |
| 95-th percentile | 0.748 |
| Maximum | 0.863 |
| Range | 0.223 |
| Interquartile range (IQR) | 0.076 |
Descriptive statistics
| Standard deviation | 0.04687569084 |
|---|---|
| Coefficient of variation (CV) | 0.06939613307 |
| Kurtosis | 0.893176244 |
| Mean | 0.6754798685 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.160761941 |
| Sum | 822.059 |
| Variance | 0.002197330392 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.64 | 650 | |
| 0.716 | 197 | 15.8% |
| 0.748 | 167 | 13.4% |
| 0.676 | 92 | 7.4% |
| 0.656 | 76 | 6.1% |
| 0.769 | 23 | 1.8% |
| 0.863 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.64 | 650 | |
| 0.656 | 76 | 6.1% |
| 0.676 | 92 | 7.4% |
| 0.716 | 197 | 15.8% |
| 0.748 | 167 | 13.4% |
| Value | Count | Frequency (%) |
| 0.863 | 12 | 1.0% |
| 0.769 | 23 | 1.8% |
| 0.748 | 167 | |
| 0.716 | 197 | |
| 0.676 | 92 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5744847987 |
|---|---|
| Minimum | 0.466 |
| Maximum | 0.758 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.466 |
|---|---|
| 5-th percentile | 0.466 |
| Q1 | 0.522 |
| median | 0.522 |
| Q3 | 0.736 |
| 95-th percentile | 0.745 |
| Maximum | 0.758 |
| Range | 0.292 |
| Interquartile range (IQR) | 0.214 |
Descriptive statistics
| Standard deviation | 0.1022052806 |
|---|---|
| Coefficient of variation (CV) | 0.1779077198 |
| Kurtosis | -0.9245493196 |
| Mean | 0.5744847987 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9065472424 |
| Sum | 699.148 |
| Variance | 0.01044591938 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.522 | 650 | |
| 0.736 | 197 | 15.8% |
| 0.466 | 167 | 13.4% |
| 0.745 | 92 | 7.4% |
| 0.555 | 76 | 6.1% |
| 0.758 | 23 | 1.8% |
| 0.74 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.466 | 167 | 13.4% |
| 0.522 | 650 | |
| 0.555 | 76 | 6.1% |
| 0.736 | 197 | 15.8% |
| 0.74 | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 0.758 | 23 | 1.8% |
| 0.745 | 92 | |
| 0.74 | 12 | 1.0% |
| 0.736 | 197 | |
| 0.555 | 76 | 6.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3598545604 |
|---|---|
| Minimum | 0.31 |
| Maximum | 0.536 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.31 |
|---|---|
| 5-th percentile | 0.31 |
| Q1 | 0.31 |
| median | 0.31 |
| Q3 | 0.435 |
| 95-th percentile | 0.484 |
| Maximum | 0.536 |
| Range | 0.226 |
| Interquartile range (IQR) | 0.125 |
Descriptive statistics
| Standard deviation | 0.06221978802 |
|---|---|
| Coefficient of variation (CV) | 0.1729025969 |
| Kurtosis | -0.5485352065 |
| Mean | 0.3598545604 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8783561795 |
| Sum | 437.943 |
| Variance | 0.003871302021 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.31 | 650 | |
| 0.435 | 197 | 15.8% |
| 0.362 | 167 | 13.4% |
| 0.484 | 92 | 7.4% |
| 0.385 | 76 | 6.1% |
| 0.438 | 23 | 1.8% |
| 0.536 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.31 | 650 | |
| 0.362 | 167 | 13.4% |
| 0.385 | 76 | 6.1% |
| 0.435 | 197 | 15.8% |
| 0.438 | 23 | 1.8% |
| Value | Count | Frequency (%) |
| 0.536 | 12 | 1.0% |
| 0.484 | 92 | |
| 0.438 | 23 | 1.8% |
| 0.435 | 197 | |
| 0.385 | 76 | 6.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.14605341 |
|---|---|
| Minimum | 0.105 |
| Maximum | 0.381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.105 |
|---|---|
| 5-th percentile | 0.105 |
| Q1 | 0.105 |
| median | 0.105 |
| Q3 | 0.209 |
| 95-th percentile | 0.255 |
| Maximum | 0.381 |
| Range | 0.276 |
| Interquartile range (IQR) | 0.104 |
Descriptive statistics
| Standard deviation | 0.06003480027 |
|---|---|
| Coefficient of variation (CV) | 0.4110468921 |
| Kurtosis | 0.6960624834 |
| Mean | 0.14605341 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.206138183 |
| Sum | 177.747 |
| Variance | 0.003604177244 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.105 | 650 | |
| 0.209 | 197 | 15.8% |
| 0.112 | 167 | 13.4% |
| 0.235 | 92 | 7.4% |
| 0.255 | 76 | 6.1% |
| 0.176 | 23 | 1.8% |
| 0.381 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.105 | 650 | |
| 0.112 | 167 | 13.4% |
| 0.176 | 23 | 1.8% |
| 0.209 | 197 | 15.8% |
| 0.235 | 92 | 7.4% |
| Value | Count | Frequency (%) |
| 0.381 | 12 | 1.0% |
| 0.255 | 76 | 6.1% |
| 0.235 | 92 | |
| 0.209 | 197 | |
| 0.176 | 23 | 1.8% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5160895645 |
|---|---|
| Minimum | 0.467 |
| Maximum | 0.603 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.467 |
|---|---|
| 5-th percentile | 0.467 |
| Q1 | 0.467 |
| median | 0.467 |
| Q3 | 0.576 |
| 95-th percentile | 0.595 |
| Maximum | 0.603 |
| Range | 0.136 |
| Interquartile range (IQR) | 0.109 |
Descriptive statistics
| Standard deviation | 0.05683198371 |
|---|---|
| Coefficient of variation (CV) | 0.1101203892 |
| Kurtosis | -1.770401163 |
| Mean | 0.5160895645 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.3898946447 |
| Sum | 628.081 |
| Variance | 0.003229874373 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.467 | 650 | |
| 0.576 | 197 | 15.8% |
| 0.595 | 167 | 13.4% |
| 0.579 | 92 | 7.4% |
| 0.495 | 76 | 6.1% |
| 0.59 | 23 | 1.8% |
| 0.603 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.467 | 650 | |
| 0.495 | 76 | 6.1% |
| 0.576 | 197 | 15.8% |
| 0.579 | 92 | 7.4% |
| 0.59 | 23 | 1.8% |
| Value | Count | Frequency (%) |
| 0.603 | 12 | 1.0% |
| 0.595 | 167 | |
| 0.59 | 23 | 1.8% |
| 0.579 | 92 | |
| 0.576 | 197 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4170164339 |
|---|---|
| Minimum | 0.357 |
| Maximum | 0.637 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0.357 |
|---|---|
| 5-th percentile | 0.357 |
| Q1 | 0.357 |
| median | 0.357 |
| Q3 | 0.521 |
| 95-th percentile | 0.637 |
| Maximum | 0.637 |
| Range | 0.28 |
| Interquartile range (IQR) | 0.164 |
Descriptive statistics
| Standard deviation | 0.09103202006 |
|---|---|
| Coefficient of variation (CV) | 0.2182936035 |
| Kurtosis | 0.3124216626 |
| Mean | 0.4170164339 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.319084158 |
| Sum | 507.509 |
| Variance | 0.008286828677 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.357 | 650 | |
| 0.521 | 197 | 15.8% |
| 0.379 | 167 | 13.4% |
| 0.637 | 92 | 7.4% |
| 0.406 | 76 | 6.1% |
| 0.547 | 23 | 1.8% |
| 0.624 | 12 | 1.0% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0.357 | 650 | |
| 0.379 | 167 | 13.4% |
| 0.406 | 76 | 6.1% |
| 0.521 | 197 | 15.8% |
| 0.547 | 23 | 1.8% |
| Value | Count | Frequency (%) |
| 0.637 | 92 | |
| 0.624 | 12 | 1.0% |
| 0.547 | 23 | 1.8% |
| 0.521 | 197 | |
| 0.406 | 76 | 6.1% |
| Distinct | 89 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 157.5538209 |
|---|---|
| Minimum | 0 |
| Maximum | 950 |
| Zeros | 39 |
| Zeros (%) | 3.1% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 47 |
| median | 128 |
| Q3 | 220 |
| 95-th percentile | 329 |
| Maximum | 950 |
| Range | 950 |
| Interquartile range (IQR) | 173 |
Descriptive statistics
| Standard deviation | 163.2830846 |
|---|---|
| Coefficient of variation (CV) | 1.036363852 |
| Kurtosis | 11.50953343 |
| Mean | 157.5538209 |
| Median Absolute Deviation (MAD) | 84 |
| Skewness | 2.883429243 |
| Sum | 191743 |
| Variance | 26661.36573 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 138 | 61 | 4.9% |
| 258 | 56 | 4.5% |
| 114 | 49 | 3.9% |
| 0 | 39 | 3.1% |
| 170 | 39 | 3.1% |
| 207 | 35 | 2.8% |
| 147 | 32 | 2.6% |
| 180 | 31 | 2.5% |
| 128 | 31 | 2.5% |
| 950 | 31 | 2.5% |
| Other values (79) | 813 |
| Value | Count | Frequency (%) |
| 0 | 39 | |
| 1 | 3 | 0.2% |
| 2 | 17 | |
| 3 | 20 | |
| 4 | 16 |
| Value | Count | Frequency (%) |
| 950 | 31 | |
| 358 | 17 | |
| 338 | 12 | 1.0% |
| 329 | 28 | |
| 328 | 22 |
| Distinct | 91 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 178.1355793 |
|---|---|
| Minimum | 0 |
| Maximum | 985 |
| Zeros | 42 |
| Zeros (%) | 3.4% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.8 |
| Q1 | 52 |
| median | 138 |
| Q3 | 265 |
| 95-th percentile | 396 |
| Maximum | 985 |
| Range | 985 |
| Interquartile range (IQR) | 213 |
Descriptive statistics
| Standard deviation | 175.5819053 |
|---|---|
| Coefficient of variation (CV) | 0.9856644358 |
| Kurtosis | 8.829622458 |
| Mean | 178.1355793 |
| Median Absolute Deviation (MAD) | 91 |
| Skewness | 2.460983796 |
| Sum | 216791 |
| Variance | 30829.00545 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 296 | 56 | 4.5% |
| 138 | 52 | 4.2% |
| 0 | 42 | 3.4% |
| 178 | 39 | 3.1% |
| 182 | 38 | 3.1% |
| 228 | 35 | 2.8% |
| 170 | 32 | 2.6% |
| 985 | 31 | 2.5% |
| 140 | 31 | 2.5% |
| 210 | 31 | 2.5% |
| Other values (81) | 830 |
| Value | Count | Frequency (%) |
| 0 | 42 | |
| 1 | 13 | 1.0% |
| 2 | 6 | 0.5% |
| 3 | 17 | |
| 4 | 14 | 1.1% |
| Value | Count | Frequency (%) |
| 985 | 31 | |
| 400 | 12 | 1.0% |
| 397 | 17 | |
| 396 | 27 | |
| 378 | 22 |
| Distinct | 100 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 335.6894002 |
|---|---|
| Minimum | 0 |
| Maximum | 1935 |
| Zeros | 39 |
| Zeros (%) | 3.1% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 99 |
| median | 260 |
| Q3 | 495 |
| 95-th percentile | 711 |
| Maximum | 1935 |
| Range | 1935 |
| Interquartile range (IQR) | 396 |
Descriptive statistics
| Standard deviation | 338.4006484 |
|---|---|
| Coefficient of variation (CV) | 1.008076657 |
| Kurtosis | 10.12865754 |
| Mean | 335.6894002 |
| Median Absolute Deviation (MAD) | 175 |
| Skewness | 2.666860937 |
| Sum | 408534 |
| Variance | 114514.9988 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 320 | 56 | 4.5% |
| 554 | 56 | 4.5% |
| 252 | 49 | 3.9% |
| 215 | 49 | 3.9% |
| 0 | 39 | 3.1% |
| 348 | 39 | 3.1% |
| 435 | 35 | 2.8% |
| 317 | 32 | 2.6% |
| 390 | 31 | 2.5% |
| 1935 | 31 | 2.5% |
| Other values (90) | 800 |
| Value | Count | Frequency (%) |
| 0 | 39 | |
| 1 | 3 | 0.2% |
| 3 | 13 | 1.0% |
| 4 | 3 | 0.2% |
| 5 | 4 | 0.3% |
| Value | Count | Frequency (%) |
| 1935 | 31 | |
| 755 | 17 | |
| 738 | 12 | 1.0% |
| 711 | 27 | |
| 706 | 22 |
| Distinct | 78 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.06244864 |
|---|---|
| Minimum | 0 |
| Maximum | 552 |
| Zeros | 39 |
| Zeros (%) | 3.1% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 25 |
| median | 75 |
| Q3 | 132 |
| 95-th percentile | 203 |
| Maximum | 552 |
| Range | 552 |
| Interquartile range (IQR) | 107 |
Descriptive statistics
| Standard deviation | 96.49227562 |
|---|---|
| Coefficient of variation (CV) | 1.025832062 |
| Kurtosis | 10.32635702 |
| Mean | 94.06244864 |
| Median Absolute Deviation (MAD) | 52 |
| Skewness | 2.69695491 |
| Sum | 114474 |
| Variance | 9310.759255 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 168 | 56 | 4.5% |
| 75 | 49 | 3.9% |
| 100 | 39 | 3.1% |
| 0 | 39 | 3.1% |
| 80 | 38 | 3.1% |
| 87 | 37 | 3.0% |
| 1 | 36 | 2.9% |
| 132 | 35 | 2.8% |
| 106 | 31 | 2.5% |
| 82 | 31 | 2.5% |
| Other values (68) | 826 |
| Value | Count | Frequency (%) |
| 0 | 39 | |
| 1 | 36 | |
| 2 | 20 | |
| 3 | 18 | |
| 4 | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 552 | 31 | |
| 204 | 17 | |
| 203 | 22 | |
| 200 | 27 | |
| 199 | 12 | 1.0% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.001731306491 |
|---|---|
| Minimum | 0 |
| Maximum | 0.005 |
| Zeros | 428 |
| Zeros (%) | 34.4% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.002 |
| Q3 | 0.003 |
| 95-th percentile | 0.005 |
| Maximum | 0.005 |
| Range | 0.005 |
| Interquartile range (IQR) | 0.003 |
Descriptive statistics
| Standard deviation | 0.001654125928 |
|---|---|
| Coefficient of variation (CV) | 0.9554206236 |
| Kurtosis | -0.8260440024 |
| Mean | 0.001731306491 |
| Median Absolute Deviation (MAD) | 0.002 |
| Skewness | 0.5704926284 |
| Sum | 2.107 |
| Variance | 2.736132584 × 106 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 428 | |
| 0.002 | 308 | |
| 0.001 | 143 | 11.5% |
| 0.003 | 114 | 9.2% |
| 0.004 | 114 | 9.2% |
| 0.005 | 110 | 8.8% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 428 | |
| 0.001 | 143 | 11.5% |
| 0.002 | 308 | |
| 0.003 | 114 | 9.2% |
| 0.004 | 114 | 9.2% |
| Value | Count | Frequency (%) |
| 0.005 | 110 | 8.8% |
| 0.004 | 114 | 9.2% |
| 0.003 | 114 | 9.2% |
| 0.002 | 308 | |
| 0.001 | 143 |
| Distinct | 123 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.006090754451 |
|---|---|
| Minimum | 0 |
| Maximum | 0.01883971406 |
| Zeros | 39 |
| Zeros (%) | 3.1% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.195905554 × 105 |
| Q1 | 0.0005228866384 |
| median | 0.005365302324 |
| Q3 | 0.009731768111 |
| 95-th percentile | 0.01761695235 |
| Maximum | 0.01883971406 |
| Range | 0.01883971406 |
| Interquartile range (IQR) | 0.009208881473 |
Descriptive statistics
| Standard deviation | 0.005782344845 |
|---|---|
| Coefficient of variation (CV) | 0.9493643016 |
| Kurtosis | -0.6863856692 |
| Mean | 0.006090754451 |
| Median Absolute Deviation (MAD) | 0.004842415686 |
| Skewness | 0.6872989243 |
| Sum | 7.412448167 |
| Variance | 3.343551191 × 105 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01382216808 | 56 | 4.5% |
| 0.006288656582 | 49 | 3.9% |
| 0 | 39 | 3.1% |
| 0.008682485957 | 39 | 3.1% |
| 0.00798503661 | 38 | 3.1% |
| 0.01085464337 | 35 | 2.8% |
| 0.007910178795 | 32 | 2.6% |
| 0.001931432523 | 31 | 2.5% |
| 0.009731768111 | 31 | 2.5% |
| 0.00668712508 | 31 | 2.5% |
| Other values (113) | 836 |
| Value | Count | Frequency (%) |
| 0 | 39 | |
| 9.98118972 × 107 | 2 | 0.2% |
| 2.994613106 × 106 | 3 | 0.2% |
| 7.98480544 × 106 | 1 | 0.1% |
| 9.981296241 × 106 | 4 | 0.3% |
| Value | Count | Frequency (%) |
| 0.01883971406 | 17 | |
| 0.0184155705 | 12 | |
| 0.0177418051 | 27 | |
| 0.01761695235 | 22 | |
| 0.01759199515 | 28 |
ocup_2010
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1245 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 1 | 816 | |
| 0 | 429 |
| Value | Count | Frequency (%) |
| 1 | 816 | |
| 0 | 429 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 816 | |
| 0 | 429 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1245 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 816 | |
| 0 | 429 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1245 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 816 | |
| 0 | 429 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1245 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 816 | |
| 0 | 429 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 157296.1566 |
|---|---|
| Minimum | 11813 |
| Maximum | 279589 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 11813 |
|---|---|
| 5-th percentile | 13964 |
| Q1 | 17961 |
| median | 279589 |
| Q3 | 279589 |
| 95-th percentile | 279589 |
| Maximum | 279589 |
| Range | 267776 |
| Interquartile range (IQR) | 261628 |
Descriptive statistics
| Standard deviation | 128823.6891 |
|---|---|
| Coefficient of variation (CV) | 0.8189881551 |
| Kurtosis | -1.984225553 |
| Mean | 157296.1566 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.1072431603 |
| Sum | 195833715 |
| Variance | 1.659554288 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 279589 | 654 | |
| 16957 | 197 | 15.8% |
| 34125 | 182 | 14.6% |
| 13964 | 92 | 7.4% |
| 17961 | 85 | 6.8% |
| 20787 | 23 | 1.8% |
| 11813 | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 11813 | 12 | 1.0% |
| 13964 | 92 | |
| 16957 | 197 | |
| 17961 | 85 | |
| 20787 | 23 | 1.8% |
| Value | Count | Frequency (%) |
| 279589 | 654 | |
| 34125 | 182 | 14.6% |
| 20787 | 23 | 1.8% |
| 17961 | 85 | 6.8% |
| 16957 | 197 | 15.8% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21173.99277 |
|---|---|
| Minimum | 1429 |
| Maximum | 35323 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 1429 |
|---|---|
| 5-th percentile | 1429 |
| Q1 | 5722 |
| median | 35323 |
| Q3 | 35323 |
| 95-th percentile | 35323 |
| Maximum | 35323 |
| Range | 33894 |
| Interquartile range (IQR) | 29601 |
Descriptive statistics
| Standard deviation | 15090.51712 |
|---|---|
| Coefficient of variation (CV) | 0.7126911437 |
| Kurtosis | -1.883488264 |
| Mean | 21173.99277 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.176789585 |
| Sum | 26361621 |
| Variance | 227723707.1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 35323 | 654 | |
| 5722 | 197 | 15.8% |
| 1429 | 182 | 14.6% |
| 11681 | 92 | 7.4% |
| 6942 | 85 | 6.8% |
| 2603 | 23 | 1.8% |
| 12373 | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 1429 | 182 | |
| 2603 | 23 | 1.8% |
| 5722 | 197 | |
| 6942 | 85 | |
| 11681 | 92 |
| Value | Count | Frequency (%) |
| 35323 | 654 | |
| 12373 | 12 | 1.0% |
| 11681 | 92 | 7.4% |
| 6942 | 85 | 6.8% |
| 5722 | 197 | 15.8% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 178470.1494 |
|---|---|
| Minimum | 22679 |
| Maximum | 314912 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 22679 |
|---|---|
| 5-th percentile | 22679 |
| Q1 | 24903 |
| median | 314912 |
| Q3 | 314912 |
| 95-th percentile | 314912 |
| Maximum | 314912 |
| Range | 292233 |
| Interquartile range (IQR) | 290009 |
Descriptive statistics
| Standard deviation | 143637.4953 |
|---|---|
| Coefficient of variation (CV) | 0.804826442 |
| Kurtosis | -1.989776182 |
| Mean | 178470.1494 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.1035223407 |
| Sum | 222195336 |
| Variance | 2.063173007 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 314912 | 654 | |
| 22679 | 197 | 15.8% |
| 35554 | 182 | 14.6% |
| 25645 | 92 | 7.4% |
| 24903 | 85 | 6.8% |
| 23390 | 23 | 1.8% |
| 24186 | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 22679 | 197 | |
| 23390 | 23 | 1.8% |
| 24186 | 12 | 1.0% |
| 24903 | 85 | |
| 25645 | 92 |
| Value | Count | Frequency (%) |
| 314912 | 654 | |
| 35554 | 182 | 14.6% |
| 25645 | 92 | 7.4% |
| 24903 | 85 | 6.8% |
| 24186 | 12 | 1.0% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5837.75743 |
|---|---|
| Minimum | 231 |
| Maximum | 9281 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 231 |
|---|---|
| 5-th percentile | 346 |
| Q1 | 1972 |
| median | 9281 |
| Q3 | 9281 |
| 95-th percentile | 9281 |
| Maximum | 9281 |
| Range | 9050 |
| Interquartile range (IQR) | 7309 |
Descriptive statistics
| Standard deviation | 3769.554093 |
|---|---|
| Coefficient of variation (CV) | 0.6457195487 |
| Kurtosis | -1.695857714 |
| Mean | 5837.75743 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.3099112155 |
| Sum | 7268008 |
| Variance | 14209538.06 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 9281 | 654 | |
| 1972 | 197 | 15.8% |
| 346 | 182 | 14.6% |
| 3681 | 92 | 7.4% |
| 4488 | 85 | 6.8% |
| 1038 | 23 | 1.8% |
| 231 | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 231 | 12 | 1.0% |
| 346 | 182 | |
| 1038 | 23 | 1.8% |
| 1972 | 197 | |
| 3681 | 92 |
| Value | Count | Frequency (%) |
| 9281 | 654 | |
| 4488 | 85 | 6.8% |
| 3681 | 92 | 7.4% |
| 1972 | 197 | 15.8% |
| 1038 | 23 | 1.8% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 KiB |
| 5105.25 | |
|---|---|
| 2395.28 | |
| 8363.26 | 12 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 8715 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2395.28 |
|---|---|
| 2nd row | 2395.28 |
| 3rd row | 2395.28 |
| 4th row | 2395.28 |
| 5th row | 2395.28 |
| Value | Count | Frequency (%) |
| 5105.25 | 1033 | |
| 2395.28 | 200 | 16.1% |
| 8363.26 | 12 | 1.0% |
| Value | Count | Frequency (%) |
| 5105.25 | 1033 | |
| 2395.28 | 200 | 16.1% |
| 8363.26 | 12 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 3299 | |
| 2 | 1445 | |
| . | 1245 | 14.3% |
| 1 | 1033 | 11.9% |
| 0 | 1033 | 11.9% |
| 3 | 224 | 2.6% |
| 8 | 212 | 2.4% |
| 9 | 200 | 2.3% |
| 6 | 24 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7470 | |
| Other Punctuation | 1245 | 14.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 5 | 3299 | |
| 2 | 1445 | |
| 1 | 1033 | 13.8% |
| 0 | 1033 | 13.8% |
| 3 | 224 | 3.0% |
| 8 | 212 | 2.8% |
| 9 | 200 | 2.7% |
| 6 | 24 | 0.3% |
| Value | Count | Frequency (%) |
| . | 1245 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8715 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 5 | 3299 | |
| 2 | 1445 | |
| . | 1245 | 14.3% |
| 1 | 1033 | 11.9% |
| 0 | 1033 | 11.9% |
| 3 | 224 | 2.6% |
| 8 | 212 | 2.4% |
| 9 | 200 | 2.3% |
| 6 | 24 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8715 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 5 | 3299 | |
| 2 | 1445 | |
| . | 1245 | 14.3% |
| 1 | 1033 | 11.9% |
| 0 | 1033 | 11.9% |
| 3 | 224 | 2.6% |
| 8 | 212 | 2.4% |
| 9 | 200 | 2.3% |
| 6 | 24 | 0.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 28 |
| Missing (%) | 2.2% |
| Memory size | 9.9 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3651 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
| Value | Count | Frequency (%) |
| 1.0 | 930 | |
| 0.0 | 287 | 23.1% |
| (Missing) | 28 | 2.2% |
| Value | Count | Frequency (%) |
| 1.0 | 930 | |
| 0.0 | 287 | 23.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| . | 1217 | |
| 1 | 930 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2434 | |
| Other Punctuation | 1217 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| 1 | 930 |
| Value | Count | Frequency (%) |
| . | 1217 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3651 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| . | 1217 | |
| 1 | 930 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3651 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| . | 1217 | |
| 1 | 930 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | id.x | NM_MU | IVS_2000 | IVS_INF_00 | IVS_CPH_00 | IVS_REN_00 | IVS_2010 | IVS_INF_10 | IVS_CPH_10 | IVS_REN_10 | MASC | FEM | POP | DOM_OCU | Dens_Dom | Dens_hab | ocup_2010 | Pop_Urbana | Pop_Rural | Pop_Total | Area | VTN_MED | URB_RURAL | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | F1810 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 79.0 | 81.0 | 160.0 | 48.0 | 0.001 | 0.003991 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 1 | 2 | F1811 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 79.0 | 81.0 | 160.0 | 48.0 | 0.001 | 0.003991 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 2 | 3 | F1811 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 79.0 | 81.0 | 160.0 | 48.0 | 0.001 | 0.003991 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 3 | 4 | F1805 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 79.0 | 81.0 | 160.0 | 48.0 | 0.001 | 0.003991 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 4 | 5 | F1804 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 79.0 | 81.0 | 160.0 | 48.0 | 0.001 | 0.003991 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 5 | 6 | F1809 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 79.0 | 81.0 | 160.0 | 48.0 | 0.001 | 0.003991 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 6 | 7 | F1802 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 50.0 | 45.0 | 95.0 | 25.0 | 0.001 | 0.002370 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 7 | 8 | F1803 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 50.0 | 45.0 | 95.0 | 25.0 | 0.001 | 0.002370 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 8 | 9 | F1471 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 0.0 | 0.0 | 0.0 | 0.0 | 0.000 | 0.000000 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
| 9 | 10 | F1793 | TAQUARITINGA DO NORTE | 0.539 | 0.406 | 0.656 | 0.555 | 0.385 | 0.255 | 0.495 | 0.406 | 0.0 | 0.0 | 0.0 | 0.0 | 0.000 | 0.000000 | 0 | 17961 | 6942 | 24903 | 4488 | 2395.28 | 1.0 |
Last rows
| Unnamed: 0 | id.x | NM_MU | IVS_2000 | IVS_INF_00 | IVS_CPH_00 | IVS_REN_00 | IVS_2010 | IVS_INF_10 | IVS_CPH_10 | IVS_REN_10 | MASC | FEM | POP | DOM_OCU | Dens_Dom | Dens_hab | ocup_2010 | Pop_Urbana | Pop_Rural | Pop_Total | Area | VTN_MED | URB_RURAL | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1235 | 1236 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1236 | 1237 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1237 | 1238 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1238 | 1239 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1239 | 1240 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1240 | 1241 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1241 | 1242 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1242 | 1243 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1243 | 1244 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |
| 1244 | 1245 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 34125 | 1429 | 35554 | 346 | 5105.25 | NaN |